Picture for Xin Li

Xin Li

College of Business, City University of Hong Kong, Hong Kong, China

Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields

Add code
Apr 30, 2025
Viaarxiv icon

Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency

Add code
Apr 29, 2025
Viaarxiv icon

Muyan-TTS: A Trainable Text-to-Speech Model Optimized for Podcast Scenarios with a $50K Budget

Add code
Apr 27, 2025
Viaarxiv icon

A BERT-Style Self-Supervised Learning CNN for Disease Identification from Retinal Images

Add code
Apr 25, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: KwaiSR Dataset and Study

Add code
Apr 21, 2025
Viaarxiv icon

Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection

Add code
Apr 20, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Add code
Apr 19, 2025
Viaarxiv icon

From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs

Add code
Apr 18, 2025
Viaarxiv icon

seeBias: A Comprehensive Tool for Assessing and Visualizing AI Fairness

Add code
Apr 11, 2025
Viaarxiv icon

OmniCaptioner: One Captioner to Rule Them All

Add code
Apr 09, 2025
Viaarxiv icon